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ABSTRACT 

We present improved methods for using stars found in astronomical exposures to calibrate 
both star and galaxy colors as well as to adjust the instrument flat field. By developing a 
spectroscopic model for the SDSS stellar locus in color-color space, synthesizing an expected 
stellar locus, and simultaneously solving for all unknown zeropoints when fitting to the in- 
strumental locus, we increase the calibration accuracy of stellar locus matching. We also use a 
new combined technique to estimate improved flat-field models for the Subaru SuprimeCam 
camera, forming 'star-flats' based on the magnitudes of stars observed in multiple positions or 
through comparison with available measurements in the SDSS catalog. These techniques yield 
galaxy magnitudes with reliable color calibration (< 0.01 - 0.02 mag accuracy) that enable us 
to estimate photometric redshift probability distributions without spectroscopic training sam- 
ples. We test the accuracy of our photometric redshifts using spectroscopic redshifts z s for 
~5000 galaxies in 27 cluster fields with at least five bands of photometry, as well as galaxies 
in the COSMOS field, finding cr((z p - z. s )/(l + z s )) ~ 0.03 for the most probable redshift z p . 
We show that the full posterior probability distributions for the redshifts of galaxies with five- 
band photometry exhibit good agreement with redshifts estimated from thirty-band photom- 
etry in the COSMOS field. The growth of shear with increasing distance behind each galaxy 
cluster shows the expected redshift-distance relation for a flat A-CDM cosmology. Photo- 
metric redshifts and calibrated colors are used in subsequent papers to measure the masses 
of 51 galaxy clusters from their weak gravitational shear and determine improved cosmo- 
logical constraints. We make our Python code for stellar locus matching publicly available 
at http://big-macs-calibrate.googlecode.com; the code requires only input catalogs and filter 
transmission functions. 

Key words: methods: observational - techniques: photometric - galaxies: clusters - gravita- 
tional lensing: weak. 



1 INTRODUCTION 



of atmospheric conditions and at widely dispersed positions across 
the telescope focal plane, can be converted to AB magnitudes. 



A principal challenge for current and planned optical and near-IR 
wide-field surveys is to estimate accurate individual redshift prob- 
ability distributions for millions of galaxies from broadband pho- 
tometry. The reliability of photometric redshift estimates is limited 
by how accurately instrument counts, measured in a wide variety 



Here we measure calibrated galaxy colors from wide-field 
Subaru SuprimeCam and CFHT MegaCam exposures of 5 1 galaxy 
cluster fields. These galaxy colors and the photometric redshifts 
we calculate provide a basis for the estimates of cluster mass we 
make in accompanying papers. While surveys of contiguous seg- 
ments of the sky can use stars in common between adjacent expo- 
sures to calibrate imaging taken in nonphotometric observing con- 
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ditions (e.g., Ivezic et al. 2007; Padmanabhan et al. 2008; Schlafly 
et al. 2012), the galaxy cluster observations we analyze are tar- 
geted observations without overlap. To find photometric zeropoints 
for these exposures and extract spatially consistent photometry, we 
have improved calibration techniques that use the Milky Way stars 
serendipitously observed in each field. An advantage of these tech- 
niques is that they do not require dedicated calibration exposures 
of standard stars or dense stellar fields. 

For most optical telescope and camera designs, the sensitiv- 
ity to astronomical sources changes substantially across the field of 
view. For example, vignetting by the optical assembly attenuates 
the light from sources near the edge of the field of view, and detec- 
tor efficiency typically varies from pixel to pixel. To correct the spa- 
tially dependent sensitivity differences, bias-subtracted exposures 
are typically divided, pixel by pixel, by calibration exposures of a 
'flat field,' such as a dome screen or the sky. The spatial distribution 
of counts recorded by standard fiat-field exposures with wide-field 
cameras, however, can differ from the actual instrument sensitivity 
by up to ~10% across the focal plane (Manfroid et al. 2001; Koch 
et al. 2003; Magnier & Cuillandre 2004; Capak et al. 2007; Reg- 
nault et al. 2009) because of a combination of geometric distortion, 
superposed reflections, and imperfect flat-field sources. Dividing 
exposures by standard flat-field images yields object fluxes that are 
systematically fainter near the field center compared to the periph- 
ery. 

Differences among the measured magnitudes of the same stars 
observed in multiple locations on the focal plane can be used to es- 
timate this spatial variation in sensitivity that remains after dividing 
data by flat-field calibration exposures. An alternate strategy is to 
compare the measured magnitude of an object to that in a photo- 
metrically consistent catalog, such as the Sloan Digital Sky Survey 
(SDSS) catalog. We combine these two methods to measure the 
Subaru SuprimeCam 'star flat' - the map of the spatially depen- 
dent calibration error - across a decade of observations and camera 
upgrades. We then use the star fiat models to adjust the images of 
the cluster fields so that the zeropoint is consistent across the field 
of view. 

Having achieved images that are 'photometrically flat,' with 
minimal relative zeropoint differences across each image, the next 
task is to calibrate the zeropoint of each image. Overlap with sur- 
vey catalogs (e.g., the SDSS) may yield useful calibration, but the 
translation from survey magnitudes may require high-order color 
terms and stars with high S/N catalog fluxes may be saturated in 
deep science exposures. Useful standard-star observations require 
photometric conditions and repeated exposures. 

An alternative and powerful calibration strategy, requiring no 
dedicated standard-star observations, is to use the fact that most 
of the stars observed in any field lie along a well-understood one- 
dimensional locus in color-color space (e.g., High et al. 2009; Mac- 
Donald et al. 2004). According to this technique, the zeropoints of 
the filters are shifted until the position of the observed stellar lo- 
cus matches the expected locus. The resulting calibration automat- 
ically corrects for Milky Way dust extinction. The SDSS photom- 
etry catalog has been used to measure the locus of stars in color- 
color space, but the SDSS locus needs to be converted to the ex- 
pected locus for the bandpass photometry needing calibration. We 
improve the accuracy and sensitivity of this technique by correcting 
the SDSS stellar locus for extinction by dust, constructing a spec- 
troscopic model for the stellar locus, and by developing a numerical 
algorithm that fits for all unknown zeropoints simultaneously. 

The stellar locus is most sensitive to stellar metallicity varia- 
tions at wavelengths near the blue or near-UV part of the spectrum, 



and SDSS w'-band photometry, used to measure the locus, is af- 
fected by a red filter leak. We therefore adjust the zeropoints for 
the Bj band so that the magnitudes for individual galaxies match 
those extrapolated from the galaxy spectral energy distributions 
measured at longer wavelengths. 

A common technique used to improve photometric calibration 
and redshift estimates is to adjust zeropoints until the photometric 
redshifts for a training set of galaxies matches their spectroscopic 
redshifts. However, galaxy spectroscopic redshift samples are not 
available for many fields, and the galaxies in training samples are 
frequently not representative of the population of interest (e.g., the 
galaxies used to measure gravitational shear). We find that the pho- 
tometric calibration for each field from stellar locus matching is 
sufficiently accurate to estimate accurate photometric redshifts. We 
find excellent agreement between the most probable photometric 
redshift z p and the spectroscopic redshift z s , with a measurement 
error of cr(( Zp - Zs )/(1 + z,)) ~ 0.03. 

Analyses that require galaxy redshifts from photometry have 
almost exclusively used a single-point estimate for the photomet- 
ric redshift, such as the most probable redshift, rather than the 
full posterior redshift probability distribution p(z). Summarizing a 
probabilistic redshift distribution as a single value can yield 'catas- 
trophic' errors and biased redshift distributions. To measure the ac- 
curacy of the full redshift probability distribution p(z), we apply our 
calibration techniques to photometry of the COSMOS field taken 
with five Subaru SuprimeCam bands (Bj Vjr + i + z + ). We find good 
agreement between five-band p(z) distributions summed over dif- 
ferent sets of galaxies (2 gal p(z)) and the Ilbert et al. (2009) 30-band 
photometric redshift distributions for the same sets of galaxies. 

Massive galaxy clusters are expected to have an associated 
shear signal with a simple dependence on the distance to the source 
galaxy as well as the geometry of the universe (Jain & Taylor 2003; 
Taylor et al. 2007). Weak-lensing tomography behind galaxy clus- 
ters will provide a useful complement to measurements of corre- 
lations in the cosmic-shear field (which additionally depend on 
the growth of structure), and will be useful for characterizing the 
performance of shape measurement and photometric redshift algo- 
rithms as well as nondegenerate cosmological parameters. The pho- 
tometric redshift information for each galaxy allows us to measure 
the growth of weak-lensing shear with increasing distance behind 
each galaxy cluster in our sample and compare against the expec- 
tation for a fiducial flat A-CDM cosmology. 

Paper I (von der Linden et al. 2012) in this series describes 
the clusters in the sample and the data reduction methods. Paper III 
(Applegate et al. 2012) presents a Bayesian approach to measuring 
galaxy-cluster masses, that relies on the full photometric redshift 
probability distributions reported here; these estimated masses are 
compared to those measured with a standard 'color-cut' method 
based on three-filter photometry for each field. 

In this second paper in the series, we begin in Section 2 by 
briefly summarizing the sample of X-ray-selected galaxy clusters 
and the wide-field imaging. In Section 3, we describe how we de- 
termine the SuprimeCam star flats, which we use to extract consis- 
tent magnitudes across the CCD array. In Section 4, we describe 
the stellar locus calibration algorithm and the spectroscopic model 
we have developed for the stellar locus. In Section 5, we discuss 
the algorithms and the templates for galaxy spectra that we use 
to estimate photometric redshift probability distributions p(z). A 
method for finding the zeropoints of u*- and Bj- band photometry 
is presented in Section 6. In Section 7, we use the galaxy clus- 
ter red sequence and spectroscopic redshift measurements in the 
cluster fields to evaluate the accuracy of our photometric calibra- 
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tions and redshift estimates. In Section 7.3, we compare the redshift 
probability distributions determined from calibrated photometry in 
five bands (Bj Vjr + ?z + ) against both the zCOSMOS spectroscopic 
redshift sample and the most probable redshift inferred from thirty 
imaging bands in the COSMOS field. General agreement between 
the observed growth of weak lensing shear with distance behind the 
massive clusters and the A-CDM expectation is presented in Sec- 
tion 8. In Section 9, we summarize the calibration techniques and 
the quality of the photometric redshift estimates. 



2 SUBARU AND CFHT IMAGING DATA 

Our imaging data set consists of wide-field optical exposures of 
51 X-ray-luminous galaxy clusters that span the redshift interval 
0.15 < z < 0.7. The data were acquired between 2000 and 2009 
with SuprimeCam mounted on the 8.3-meter Subaru telescope and 
with MegaPrime on the 3.6-meter Canada-France-Hawaii telescope 
(CFHT). The field of view of SuprimeCam is 34' x 27' (0.2" 
pixel -1 ), while the MegaPrime has a 1° x 1° field of view (0.187" 
pixer 1 ). Each cluster field was imaged in at least three separate 
broadband filters, and 27 fields were imaged with five or more 
SuprimeCam (BjV jR c l c z + ) or MegaPrime (u'g'r'i'z*) bandpasses. 
Paper I describes the cluster sample and observations, the overscan, 
bias, and dark corrections, as well as the flat-field and superflat pro- 
cessing steps. 



3 MEASURING THE STAR FLAT 

The intensity of a pixelated image recorded by a CCD camera 
mounted on a telescope depends, in part, on the properties of the 
CCD sensors and the readout electronics, as well as the illumina- 
tion of the focal plane by the telescope optics. Correcting images 
for spatial sensitivity variations is necessary to be able to extract 
accurate fluxes for galaxies and stars. 

Calibration images of illuminated dome screens, the twilight 
sky, and the night sky vary smoothly on scales of tens of arcsec- 
onds, and these exposures of flat-field sources are useful for mea- 
suring local pixel-to-pixel sensitivity variations. The variation of 
'flat-field' images taken with a wide-field camera on several ar- 
cminute scales, however, can disagree with the camera's actual sen- 
sitivity to point sources. The discrepancy can be up to ~10% across 
the field of view of wide-field instruments (Manfroid et al. 2001; 
Koch et al. 2003; Magnier & Cuillandre 2004; Capak et al. 2007; 
Regnault et al. 2009). Dividing wide-field science images by flat- 
field exposures leads to objects near the center of the focal plane 
that are systematically fainter than sources at the periphery. 

Geometric distortion along the optical paths of many 
wide-field telescopes, such as Subaru/SuprimeCam and 
CFHT/MegaPrime, results in a decrease in pixel scale with 
increasing distance from the center of the focal plane. In Suprime- 
Cam, the pixel scale decreases by ~1.5% between the center 
and 15' from the center, with a corresponding decrease of ~3% 
in the solid angle subtended by a pixel. (See Fig. 3 in Paper I.) 
This effect means that pixels near the center of the field will 
receive proportionally greater flux than they otherwise would when 
illuminated by a hypothetical calibration source with constant flux 
per unit solid angle. The effect of variation in pixel solid angle 
due to geometric distortion can be explicitly corrected using the 
Jacobian of the astrometric distortion (e.g., Capak et al. 2007), or 



left for correction by a 'star flat' (e.g., Regnault et al. 2009), which 
is the approach we take. 

Light that scatters off the surfaces of reflective and refrac- 
tive optical elements also contributes to the spatial distribution of 
counts in flat-field images. On average, photons scattering from fil- 
ters and the CCD sensors are redirected toward the field center (e.g., 
Regnault et al. 2009). This can be seen, for example, in the halos 
that surround bright stars. Stellar halos, which are due to extra re- 
flections, are each centered on a point that is offset from the star 
towards the center of the field of view. For diffuse sources that ex- 
tend across the field of view, such as dome screens or the night sky, 
a continuum of superposed reflections accumulates near the center 
of the exposure. 

Light baffles for wide-field instruments can only be mod- 
estly effective. Capak et al. (2007) showed that the intensity of a 
SuprimeCam flat field varies by ±5% in the corners of the focal 
plane due to light scattered from outside the field of view and that 
the intensity of scattered light depends on the position of the tele- 
scope and whether exposures were taken in dark or twilight con- 
ditions. To reduce our susceptibility to these intensity variations in 
the periphery, we exclude regions of the sensors that are more than 
15' from the center of the field. 

Zeropoint variations across the focal plane that remain after 
dividing images by flat-field exposures can be measured with two 
approaches. A first technique is to image the same set of stars at 
different positions in the focal plane by dithering the telescope or 
rotating the camera, and find the spatially dependent corrections 
(the 'star flat') that result in the smallest dispersion in magnitudes 
for each star measured in different positions (Manfroid et al. 2001; 
Magnier & Cuillandre 2004; Capak et al. 2007; Padmanabhan et al. 
2008; Regnault et al. 2009; and Wittman et al. 2012). A second ap- 
proach is to compare the measured magnitudes of stars to those 
in photometrically consistent catalogs, such as the SDSS catalogs 
(Koch et al. 2003). We apply both approaches and describe the pro- 
cess in detail below. 

The CFHT ELIXIR pipeline fits and corrects for a star flat 
for MegaPrime observations (Magnier & Cuillandre 2004), so we 
measure only the SuprimeCam star flat across nine years of obser- 
vations. In addition to spatially varying zeropoint corrections, Reg- 
nault et al. (2009) find evidence for position-dependent color terms 
across the CFHT MegaPrime field of view, which the authors at- 
tribute to an angular dependence of the transmission function for 
interference filters. 



3.1 Summary of SuprimeCam Sensor and Electronics 
Upgrades 

We have grouped our 2000-2009 SuprimeCam images into succes- 
sive periods that correspond to upgrades of the CCD array (see Ta- 
ble 3 of Paper I). The CCD sensors in the early 8- and 9-chip con- 
figurations exhibited nonlinear response. We were able to correct 
the nonlinearity except for two MIT/Lincoln CCD sensors in con- 
figuration 8 and three MIT/Lincoln CCD sensors in configuration 9, 
which we discarded. The ' 10_1 ' and ' 10_2' configurations, installed 
March 27, 2001, feature ten MIT/Lincoln CCD sensors with fewer 
cosmetic defects and linear response below saturation. The upper 
left chip had lower quantum efficiency than the other CCDs, but this 
can be corrected by the flat field. The last set, the '10_3' configu- 
ration installed in July 2008, consists of ten Hamamatsu Photonics 
CCD sensors, with pixels of varying size and shape (S. Miyazaki, 
private communication). 
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3.2 Flat-Field Correction Applied to Subaru Imaging 

Before fitting for the SuprimeCam star flat, we divide each image 
by a stack of dome-flat or twilight-flat exposures taken during the 
same observing run (or adjacent runs if few fiats are available), 
and then by a night-sky flat (or 'superflat'). The night-sky flat is 
the median stack of object-subtracted exposures, already divided 
by the stacked dome or twilight flats, with no bright stars or strong 
internal reflections (see Erben et al. 2005) and typically varies by 
(0.5-1.5)% across the field of view. 

3.3 Dither Patterns and Camera Rotations 

Telescope dithers or camera rotations that move stars substantial 
distances across the focal plane are helpful to constrain spatial ze- 
ropoint variation. The 34'x27' SuprimeCam exposures that we an- 
alyze have dither patterns that generally vary by a relatively mod- 
est angle of 1' to 2'. For Subaru images that we took to measure 
the masses of the MACS (Ebeling et al. 2001; Ebeling et al. 2007; 
Ebeling et al. 2010) galaxy clusters, we rotated the camera by 90° 
to facilitate star-flat fits. Subaru images that were taken by other 
groups and included in our analysis, which we accessed through the 
Subaru-Mitaka-Okayama-Kiso Archive (SMOKA) 1 (Baba et al. 
2002), can have different dither sequences and rotations (e.g., ro- 
tations of 45° or no rotation). 

3.4 Measuring the Star Flat 

We perform a separate star-flat fit to each of 471 sets of expo- 
sures corresponding to a given cluster field, filter, and observing 
run (e.g., Abell 68, Bj, 18 July 2007). The median number of ex- 
posures we use each star-flat fit is six, with a typical relative rotation 
of 90° after the first three exposures. 

3.4. 1 Selecting the Star Sample 

To select stars, we choose objects that the SExtractor (Bertin & 
Arnouts 1996) neural network classifier, supplied with the im- 
age seeing (SEEING_FWHM), suggests are stellar-like (i.e., where 
CLASS_STAR > 0.65). We also admit only those star candidates 
for which the flux within 2.5 Kron (1980) radii (MAG^UTO) has 
less than 0. 1 mag uncertainty, as well as where the stellar image 
is unblended, has no bad or saturated pixels, has no bright neigh- 
bors, and is not truncated by a CCD sensor boundary (i.e., we in- 
clude only objects with FLAG=0). To exclude objects that are sat- 
urated or affected by detector nonlinearity in our images, we in- 
clude only measurements of objects with a maximum pixel value 
less than 25,000 ADU above the -10,000 ADU bias level, well be- 
low the full-well capacity of ~35,000 ADU above the bias level. 
Both for fitting the star flats and for measuring galaxy photometry 
and shapes for the analysis of weak lensing, we exclude objects in 
each catalog that are more than 15' from the center of the field. 
To identify and remove exceptionally discrepant magnitudes early, 
we use objects that appear in different exposures to estimate the 
relative zeropoint of each exposure, and subtract these zeropoints 
from the magnitude measured for each object. For each star can- 
didate, we remove any magnitude measurements rn^ for which 
l"Car ~ median(m)| > 1 mag, where median(ra) is the median of 
magnitudes across all exposures. 

1 http://smoka.nao.ac.jp/ 



3.4.2 Star-flat Model 

We model the SuprimeCam position-dependent zeropoint with a 
spatially varying function C(x, y, chip, rotation), which is the sum 
of a separate function f(x, y) mt for each rotation of the camera and 
a single set of chip-dependent offsets <9 c hi P , shared across the rota- 
tions and dithered exposures: 

C(x,y, chip, rotation) = f(x,y) mt +O cMp , (1) 

where f(x, y) TOt is the product of third-order Chebyshev polynomi- 
als in x and y coordinates on the focal plane. 

For the stars in each exposure that meet the sample criteria, we 
express the measured magnitude of each star, mj^, in terms of the 
spatially varying correction C(x,y, chip, rotation), and a magnitude 
m star dl:1 > which is a f ree parameter in the fit and corresponds to the 
stellar magnitude that would be measured if the exposures had no 
position-dependent zeropoint variation: 

«5 = C ( x > y. chip, rotation) + ZP exp + m™ dd , (2) 

where ZP exp is the zeropoint for each exposure. 

We also introduce constraints from available magnitudes from 
photometrically consistent catalogs. These are SDSS magnitudes 
or, alternatively, SuprimeCam or MegaPrime magnitudes corrected 
by a previous successful star-flat fit. We express each available cat- 
alog magnitude in terms of the modeled magnitude m™^, a 
zeropoint offset <9 cat , and a color term <S cat x : 

m slL = m ™M d + ^cat - <S C at X C^. (3) 

The star color c™' r is calculated from catalog magnitudes (e.g., 
g' - r'). We find the coefficient <S cat of the color term before fitting 
for the other star-flat parameters. 

The free parameters in the model are the coefficients in the 
Chebyshev polynomials in f(x,y) TOt for each camera rotation, <9 c hi p , 
O cat , ZP exp , and, for each star, m™ del . 

3.4.3 Fitting Procedure 

We construct a set of linear equations from two sets of equa- 
tions: Equation 2 for each observation of each stellar candidate, 
and Equation 3 for each star with a catalog magnitude. The matrix 
representing the system of equations where each equation corre- 
sponds to a row is sparse because only a handful of equations in- 
clude m™° di:1 for each star. Each row in the matrix is weighted by the 
inverse-square of the uncertainty in m^ r or m^. When the statis- 
tical uncertainty is less than 0.04 mag, we set the uncertainty to be 
0.04 mag for purposes of the fit, so that a small number of objects 
will not dominate the solution. We use CXSparse (Davis 2006), a 
C library for sparse matrix algebra, to apply a QR decomposition 
computed with a Householder transformation. 

After computing an inital solution, measured magnitudes m°*JJ 
that are more than 5<x from the corresponding model magnitudes 
mf°* a are removed. We then refit the remaining measured and cat- 
alog magnitudes, m^ r and m',". 

When fewer than 400 stars in the exposure have a catalog mag- 
nitude m^ r , we remove linear terms in the Chebyshev polynomials 
in f(x, y)rot because the set of exposure zeropoints <9 ch i p exhibit a 
degeneracy with the linear Chebyshev terms. Consider, for exam- 
ple, exposures taken after three successive telescope dithers to the 
north by 1'. If all stars show fluxes that diminish by 10% with each 
consecutive exposure, these changes in flux could reflect either 
worsening atmospheric transparency (i.e., ZP exp ) or a linear spa- 
tial gradient in the camera's sensitivity. The addition of a sufficient 
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Table 1. Diagnostic fit statistics averaged across acceptable star flats, sorted by chip configuration and whether SDSS stellar photometry was available for at 
least 400 matched stellar objects. The statistic (<Xj ack ) shows that the star-flat correction is well constrained. 



Chip Configuration 


SDSS Matches 


Cluster/Filter/Run Combinations 


(Cjack) 


10_3 


None 


3 


0.003 mag 


10 3 


Yes 


4 


0.003 mag 


10_1 & 1()_2 


None 


69 


0.004 mag 


10J & 10_2 


Yes 


96 


0.003 mag 


8&9 


None 


6 


0.009 mag 


8&9 


Yes 


5 


0.021 mag 




-0.04 -0.03 -0.02 -0.01 0.01 0.02 0.03 0.04 -0.04 -0.03 -0.02 -0.01 0.01 0.02 0.03 0.04 



Figure 1. The average star-flat model for the Subaru SuprimeCam Bj (left panel) and Rc (right panel) filters. The star-flat models show our estimate of the 
average, position-dependent zeropoint correction to stellar magnitudes measured from images corrected with a night-sky flat (the 'superflat'). The best-fit 
star-flat model maximizes the agreement among flux measurements of the same stars in multiple positions, as well as agreement with available magnitudes 
from photometrically consistent catalogs (i.e., from the SDSS or previously corrected SuprimeCam or MegaPrime photometry). Dividing pixelated images 
(already corrected by the night-sky flat) by the star flat yields corrected images with uniform photometry. The shaded horizontal bar below each figure shows 
the correction to the night-sky flat field in magnitudes. We restrict photometry and star-flat fits to the area within 15' of the center of the field because of 
increased levels of light scattered from outside the field of view into the periphery, strong vignetting, and changes in the point spread function at large radii. 
The model for the star flat consists of the product of third-order Chebyshev polynomials in x and y plus a chip-dependent offset, as described in Sec. 3.4.2. 



number of trij^ magnitudes from an external photometrically con- 
sistent catalog for stars spanning the field of view, however, breaks 
this degeneracy. 

3.4.4 Evaluating the Star-Flat Fit to Each Set of Exposures 

To identify star-flat solutions that have small statistical uncertainty 
and are robust to outliers, we calculate a statistic, which we call 
<7j ack , using star-flat fits to ten separate jackknife samples, each con- 
taining a randomly selected set of half the stars. The statistic, <Tj ack , 
is a measure of the variation in the star-flat solution across the sam- 
ples. 

We create a pixelated image of the best-fit star-flat model 
(in magnitudes) across the focal plane for each jackknife sample, 
where each pixel cell corresponds to an area of 20" x 20", or 
100 x 100 CCD pixels. Each jackknife correction map can then 
be represented by A"., where i and j are the pixel coordinates, 
n denotes the wth jackknife sample, and A" ; . is the mean of the 
star-flat model within pixel (;', j). We use the maps to assess 
the uncertainty of the star-flat fits. Since we are interested only 



in the relative spatially varying corrections, we subtract the me- 
dian correction across the image from the correction in each pixel: 
6A"j = A"j - median(A"). We then calculate the standard deviation 
cr(SAjj) for each pixel across the jackknife fits. 

To assess the uncertainty of the correction calculated from 
each star flat, we use the median of cr(SAjj) across all pixels in 
the image, which we call 0-j ack . 

3.4.5 Required Value of cr^ A 

Selecting a maximum acceptable value for <7j ack presents a tradeoff 
between two objectives. On the one hand, we want to measure the 
star flat for as many observing runs and pointings as possible to 
correct for any variation with position in the sky and with time. On 
the other hand, we want to apply a correction only when the statis- 
tical uncertainty on the correction is small. We adopted the criteria 
outlined in the following paragraph but, as shown in Table 1, the 
average values of the t7j ack statistic for each chip configuration are 
substantially better than these thresholds. 

For chip configurations 8 and 9, the minimal requirement for 
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Figure 2. The improvement in the agreement between SuprimeCam Vj and uniform SDSS stellar photometry in the RXJ1720. 1+2638 galaxy cluster field 
from star-flat correction. A first-order color term is used to transform SDSS g' magnitudes to expected SuprimeCam Vj magnitudes. The top panel shows 
the difference in magnitudes before dividing images (already corrected by the night-sky flat) by the best-fit star-flat model, and the bottom panel shows the 
difference in magnitudes after star-flat correction. Flux measurements of the same stars at multiple positions on the focal plane, in particular before and after 
90° camera rotations, place the strongest constraints on the star-flat model, except when fields have high stellar density and comparison magnitudes from 
photometrically consistent catalogs are available. Comparison SDSS magnitudes are important for constraining linear terms in the Chebyshev polynomials. 
(Without SDSS photometry of the field, linear terms are degenerate with variations in the zeropoint offsets between dithered exposures; in those cases, we 
exclude first-order terms from the Chebyshev polynomials.) 



using the star-flat correction is that tr jiick < 0.03 mag (i.e., star flat 
constrainted to ss 0.03 mag). For chip configurations 10_1, 10_2, 
and 10_3, where greater numbers of objects are generally available, 
we consider a fit acceptable if <Xj ack < 0.01 mag. We attempted star- 
flat fits to 471 sets, and 183 of the solutions satisfied these require- 
ments. 



A star-flat solution may be too poorly constrainted to meet the 
minimal <Xj ack requirement for several reasons. These include low 
stellar density in the galaxy cluster field, exposures taken without 
camera rotation, no overlap with the SDSS footprint, and minimal 
telescope dithers between exposures. When the star-flat fit does not 
meet the t7j ack criterion, we look for a satisfactory star flat for a 
different filter of the same field and use the corrected «i'™ dl:1 mag- 
nitudes as reference magnitudes. When even these additional 
constraints from trij^ do not yield an acceptable solution, we cor- 
rect the data by the satisfactory star flat for data taken closest in 
time (with the same filter and chip configuration). 



3.5 The Measured Star-Flat Correction 

Figure 1 shows the star-flat shape for the B j and R c bands, aver- 
aged across the fits that meet the o- jack criteria for each band. The 
star-flat corrections averaged over satisfactory fits show a maximal 
variation across the 15'-radius field of ~0.06 mag in the Bj and z + 
bands and ~0.05 mag for Vj, Rc, and I c . Within a 13'-radius field, 
the relative corrections are ~0.05 mag for Bj and z + and ~0.03-0.04 
mag for Vj, R c , and I c . Figure 2 shows how the star-flat correction 
improves the agreement between the SuprimeCam Vj and SDSS 
magnitudes for the RXJ1720.1+2638 field. 

3.6 Comparison with Capak et al. (2007) SuprimeCam Star 
Flat Analysis 

To be able to extract accurate photometry for the COSMOS project, 
Capak et al. (2007) measured a star flat for Subaru SuprimeCam 
imaging taken from 2004 through 2005 (chip configuration 10_2). 
Before fitting for the star flat, Capak et al. (2007) first normalized 
images by a dome flat and then divided the dome-flat-normalized 
photometry by the Jacobian of the astrometric solution to explic- 
itly correct for geometric distortion (~3% at a radius of 13')- By 
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contrast, we divided dome- or twilight-flat-normalized images by 
a night-sky flat, which shows variation of ~0.005 mag between 
field center and 15', and made no explicit correction for pixel size 
changes. 

The COSMOS correction, including the ~3% explicit adjust- 
ment for pixel size, agrees with our corrections to ~(l-2)% within 
13' of the center of the field. The only difference with the COSMOS 
total correction that we find is that the S ; and z + corrections may be 
-1% greater than one would expect from only the geometric effect 
within 13'. 

Since the images we analyze were taken between 2000 and 
2009, and span three major SuprimeCam upgrades, we must char- 
acterize the star-flat corrections across all configurations. Chip- 
dependent zeropoints are substantially larger for data taken during 
the early 8- and 9-chip SuprimeCam configurations because the ar- 
ray included chips from different manufacturers. 

3.7 Implementing Correct SWarp Image Resampling 

Resampling the exposures of each cluster field taken using differ- 
ent camera rotations, in separate dithered positions, and multiple 
filters to a common pixel grid and coordinate system makes it pos- 
sible to measure the magnitudes and colors of objects easily. As 
described earlier, geometric distortions through the optical paths of 
most wide-field telescopes yield a pixel scale (arcsecond pixel -1 ) 
that decreases with increasing distance from the field center. 

We find that the commonly used SWarp program (Bertin et al. 
2002, version 2.19.1) requires a patch to its source code to preserve 
relative fluxes of point sources imaged by separate CCD sensors. 
The SWarp 'VARIABLE' FSCALASTRO_TYPE setting (in con- 
trast to 'NONE' and 'FIXED') preserves the flux ratios of sources 
on the same CCD sensor during resampling, but we find that the ra- 
tios among the fluxes of objects on different CCD sensors become 
altered. 

This problem arises from the fact that, in the SWarp source 
code, the pixel flux is multiplied by a factor that depends on 
the average pixel scale of each resampled image in the 'VARI- 
ABLE' FSCALASTRO.TYPE. This chip-dependent factor yields 
chip-dependent zeropoint offsets with the current SWarp version. 
A simple modification to the SWarp source code can repair this 
problem by removing the unneeded factor for each chip. 

We discovered the problem with SWarp after all images were 
coadded, so we applied the appropriate position-dependent flux 
correction to the photometry of objects measured from resampled 
exposures. 



4 USING STELLAR LOCUS MATCHING AS A 
CALIBRATION TOOL 

The traditional approaches to calibrating photometry through 
standard-star observations or from overlap with well-calibrated sur- 
veys (e.g., SDSS) are not practical or sufficiently reliable for our 
analysis of SuprimeCam and MegaPrime imaging. A substantial 
fraction of nights have no standard-star observations and, even 
when they are available, robust calibration is possible only in the 
most favorable atmospheric conditions. Bright SDSS stars (R c < 
19 mag) saturate in our exposures, and Sloan coverage is therefore 
not useful for high precision calibration. To fit for accurate zero- 
points, we apply an improved stellar locus matching method. 

Investigations at several wavelengths show that the Galactic 
dust sheet extends only -50-100 pc from the midplane (Drimmel 



& Spergel 2001; Marshall et al. 2006; Kalberla & Kerp 2009; Jones 
et al. 201 1), and the vast majority of stars visible in SDSS, Subaru, 
and CFHT imaging are at distances beyond the edge of the Galactic 
dust sheet (e.g., High et al. 2009). Star colors lie predominantly in a 
narrow band in color-color space, called the stellar locus. The locus 
has only modest sensitivity to metallicity variations among Milky 
Way stars at wavelengths redder than -5000 A (Ivezic et al. 2007; 
High et al. 2009). Therefore, by shifting zeropoints for each band 
until the measured stellar locus matches the dereddened SDSS stel- 
lar locus, we can establish a photometric calibration that includes a 
correction for the Galactic dust. 

As an example of a typical stellar locus and our model, we 
show in Figure 3 the results of a fit for the instrumental zeropoints 
of nine SuprimeCam and MegaPrime bands for the RXJ1347-11 
galaxy-cluster field. Below, we discuss significant improvements 
to the zeropoint accuracy of stellar locus matching that can be 
achieved by first dereddening the measured SDSS stellar locus, 
then constructing a spectroscopic model for the dereddened locus, 
and finally solving for consistent zeropoints in multiple bands si- 
multaneously (e.g., BjVjR c IcZ + ). In the following section, we first 
review the astrophysical basis for using the stellar locus as a cali- 
bration tool and then describe our improved technique. 



4.1 Stellar Populations Along the Locus 

Several teams have used the stellar locus as a tool to calibrate op- 
tical photometry (MacDonald et al. 2004; Ivezic et al. 2004; High 
et al. 2009; Gilbank et al. 2011; Coupon et al. 2012). Recently, 
High et al. (2009) applied the calibration technique to g'r'i'z' Mag- 
ellan imaging of 11 galaxy clusters. For these z < 0.25 galaxy 
clusters, they fit the average color of the cluster red-sequence 
galaxies and estimated the cluster redshifts with 0.6% accuracy in 
o~((z p - z s )/(l + z s )) (see also High et al. 2010). To understand the 
effectiveness of the stellar locus technique as well as its limitations, 
they also scrutinized the astrophysical understanding of the Milky 
Way stellar locus. Here we discuss the constituents of the locus, 
their distances from the Earth, and the effect of metallicity varia- 
tion in the Milky Way. 

Blue stars are generally visible to much greater distances than 
red stars, because blue stars are predominately more luminous. As 
shown in Figure 3, the path of the locus of stars shows a sharp 
change in direction in the plot of g* - Rc versus Rc - Ic at Rc - 
l c ~ 0.6 (or SDSS r' - i' « 0.7). The part of the locus to the blue 
side of the kink includes A through K stars (members of the halo 
and disk main sequence), as well as an evolved halo population 
(High et al. 2009). The origin of the 'kink' is the fact that the disk- 
metallicity M-dwarf stars redward of the kink have substantially 
nonthermal spectra due to absorption by oxides and metal hydrides, 
including TiO, VO, CaH, and FeH (e.g., High et al. 2009; West 
et al. 2011). By contrast, low-metallicity M dwarfs have thermal 
spectra, and their colors place them along an extrapolation of the 
locus to the blue side of Rc~Ic ~ 0.6. The absence of such a second 
branch in the stellar color-color plots for the Subaru and CFHT 
images analyzed here suggests that we detect few halo M dwarfs 
in our exposures. This absence is consistent with the faint intrinsic 
luminosities of M dwarfs (10 < r < 15 mag). SDSS identified only 
a few thousand M subdwarfs with high confidence across the entire 
SDSS footprint to their 22-mag limit in the r 1 band (LSST Science 
Collaborations et al. 2009). 
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Figure 3. Color calibration of photometry through nine Subaru SuprimeCam (BjVjRcIcz + ) and CFHT MegaPrime (g'r'i'z*) filter bands of the RXJ1 347-1 1 
galaxy cluster field. We simultaneously vary eight zeropoints and hold one constant (here Rc) during the fit to maximize agreement between the instrumental 
stellar locus and the model stellar locus. The red points and lines show the model for the expected stellar locus in color-color space, and the blue points show 
the colors for the individual stars in the field after applying the best-fit photometric zeropoints. For each point along the SDSS u' g'r'i'z' stellar locus (indexed 
by g' - i' color), we construct a best-fit model spectrum that can reproduce the point's color (see Figure 4). To calibrate photometry, we compute synthetic 
magnitudes from each model spectrum and the total response functions of the filters and instrument used. These synthetic magnitudes form the model stellar 
locus that we use in each fit for unknown zeropoints shown. The arrow shows the direction and magnitude of reddening due to dust predicted by the Schlegel 
et al. (1998) dust extinction map. 



4.2 Sensitivity of Stellar Colors to Metallicity 

The stellar populations with useful photometry in an astronomical 
image will depend, at the bright end, on the exposure's saturation 
limit and, at the faint end, on the detection limit. Our SuprimeCam 
imaging generally saturates at R c ~ 19 mag and yields detections to 
Rc ~ 26 mag. For SDSS exposures, the range of useful magnitudes 
in r' is between ~14 and ~22.5 mag. 

Catalogs made from deeper exposures will include a higher 
fraction of halo members in the stellar locus to the blue side of 
the kink in the color-color plot in Figure 3(a). Although stars with 
0.2 < g - r < 0.4 are approximately evenly split between the disk 
and halo populations at SDSS depths, our Subaru and CFHT cat- 
alogs contain a higher fraction of halo stars, given the deeper lim- 
iting magnitudes. Since we use the SDSS stellar locus as a model 
for the locus in our deeper Subaru and CFHT exposures, we need 
to consider whether the properties of the stellar locus may change 
with a larger population of halo-metallicity main sequence stars. 

Using SDSS photometry and spectroscopy, Ivezic et al. (2008) 
find that the halo population has a Gaussian metallicity distribu- 
tion with mean [Fe/H] ha i « -1.5, while the metallicity of disk 
stars decreases with distance Z from the midplane according to 
[Fe/H] dlsk = (-0.78 + 0.35, exp(-|Z|, kpc" 1 )) dex. 

High et al. (2009) use model stellar atmospheres to study how 
the metallicity difference between the disk and halo main sequence 
populations affects the color of their stellar population. They assign 
a metallicity of [Fe/H]= -0.5 for the disk population and a metal- 
licity of [Fe/H]= -1.5 for the halo population. The difference in 
g' — r' color for main sequence stars in the two populations, in the 
region of the stellar locus to the blue end of the kink, is only ~0.01 



mag. However, the u' - g' colors for the disk main sequence stars 
are ~0. 1 mag redder than those for their halo counterparts. 

For this reason, we elect not to use the stellar locus to cali- 
brate near-UV bands for our 'Weighing the Giants' study of Subaru 
and CFHT imaging. Near-UV emission of stars has comparatively 
strong sensitivity to the difference between halo and disk metallici- 
ties (High et al. 2009), and the SDSS u '-band filter leak near 7 100 A 
additionally affects measurements of the stellar locus in colors that 
include the u' band. There are, however, strategies that may enable 
use of the stellar locus measured from SDSS magnitudes to cali- 
brate near-UV photometry in future efforts. 



4.3 SDSS Stellar Locus Corrected for Extinction 

We measure the stellar locus from SDSS Data Release 8 (Aihara 
et al. 201 1) photometry corrected for Galactic dust extinction. Stars 
are drawn from fields with comparatively low Galactic extinction 
(A v < 0.2 mag) and their magnitudes corrected by the Schlegel 
et al. (1998) extinction map. We place stars into bins according 
to their g' - i' color (a proxy for effective temperature) and, for a 
series of colors (e.g., g'-r', r'-f), measure the median stellar color 
within each bin following the general approach used by Covey et al. 
(2007). 

High et al. (2009) instead used the Covey et al. (2007) stellar 
locus, which is not corrected for extinction by Milky Way dust. We 
find that g' - z' locus color and photometric calibration change by 
~0.05 mag after correcting the stellar locus for Milky Way extinc- 
tion. 
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Figure 4. Best-fit stellar spectrum model for a sample point along the SDSS 
stellar locus: g' - i' as 1,6 mag. The red curve in the middle region shows 
the SDSS spectroscopic data for a K5 main sequence dwarf whose color 
best matches the color of this point on the stellar locus. The range of 
wavelengths covered by SDSS spectra (from 3900 to 9100 A) must be ex- 
tended in the blue and red regions to cover the Subaru/SuprimeCam and 
CFHT/Megaprime bandpasses that we wish to calibrate. The curves shown 
in blue correspond to the spectrum from the Pickles (1998) library with a 
shape most similar to the SDSS spectrum in the overlap region. These are 
multiplied by functions that are linear in wavelength to reproduce the full 
u'g'r'i'z' color of the point on the stellar locus, as described in the text. 



4.4 Defining Model Spectra Along the Locus 

The broadband filters used for the Subaru and CFHT observations, 
especially the SuprimeCam BjVjRcIc bandpasses, have substan- 
tially different transmission functions than SDSS u'g'r'i'z' filters. 
The color transformations are nonlinear, and we found that a sim- 
ple, linear color term yielded poor zeropoint solutions. To perform 
improved color transformations between magnitudes in different 
filter sets, we construct a comprehensive spectroscopic model for 
the SDSS stellar locus. An advantage of this spectroscopic model 
is that the locus for any filter system can be easily computed, re- 
quiring only the total response function for each filter. The total 
response function is the product of the transmission function for 
the atmosphere, reflectivity of the telescope mirrors, transmission 
function for the optics and filter, and the CCD-sensor wavelength- 
dependent response. 

For each of 75 points along the SDSS locus, indexed by g' - i' 
color, we identify the SDSS stellar spectrum (from spectroscopic 
data) whose synthetic g'r'i' colors (i.e., g' - r', r' - i', and g' - i') 
best match the g'r'i' colors for this point on the stellar locus. For 
this step, we use the CasJobs 2 tool to query the SDSS database. 
The wavelength range covered by SDSS spectra (3900 - 9100 
A) does not extend through the u' and z' bands, so we identify 
the Pickles (1998) main sequence dwarf stellar spectrum that best 



http://casjobs.sdss.org/Caslobs/ 



cross-correlates with the SDSS spectrum in the range of overlap- 
ping wavelengths. With the Pickles (1998) spectrum, we extend the 
SDSS spectrum at both the blue and red end. Figure 4 shows an 
example composite spectrum, constructed from SDSS and Pickles 
(1998) K5 main sequence dwarf spectra, for a stellar locus point 



with g'-r'sl.l mag and r 1 ■ 



■ 0.4 mag. 



We assemble a continuous spectrum by scaling the Pickles 
(1998) blue and red segments separately to match the SDSS spec- 
trum in the overlapping wavelength ranges of 4100 to 4600 A and 
8000 to 9000 A, respectively. The spliced spectrum is then com- 
prised of the Pickles (1998) blue segment up to 4200 A, the SDSS 
segment between 4200 A and 8500 A, and the Pickles (1998) red 
segment above 8500 A, as can be seen in Figure 4. We next intro- 
duce functions that are linear in wavelength that allow us to adjust 
the Pickles (1998) segments to reproduce the locus point's u'g'r'i'z! 
colors. We multiply the blue Pickles (1998) segment by a linear 
function, Fbi uc (/1) = A A + B, where Fbi„ c (4200) = 1, and then the 
red Pickles (1998) segment by a second linear function, F re(i (A), 
with F rc(i (8500) = 1. We fit for the slope of each line to match the 
u' - g' and (' - z' colors, respectively, of the point on the stellar 
locus. 



4.5 Fitting the Stellar Locus 

The objective of our fitting algorithm is to find the set of zeropoints 
that yields the best match between the observed and model stellar 
locus. We perform a search for these zeropoints using x 2 minimiza- 
tion and the downhill simplex method (Nelder & Mead 1965). 

Calculating the x 2 f° r a given model locus and set of filter 
zeropoints ZPf is a two-step process. We follow a similar strategy 
to that of High et al. (2009) but employ an improved methodology 
that enables us to fit simultaneously and self-consistently for the 
complete set of unknown zeropoints. 

For a given set of zeropoints ZPf, we search, for each ob- 
served star, through all 75 points along the model locus to find the 
point with the best^ 2 (i.e., the best match to the observed stellar 
spectral energy distribution (SED)). Keeping the zeropoints fixed, 
we repeat this process for all stars, summing an overall x 2 to cal- 
culate a goodness-of-fit (GOF) statistic. We repeat this overall pro- 
cess for each new set of zeropoints until the best set of zeropoints 
is found. 

High et al. (2009) instead solved for the zeropoints of any two 
filters (e.g., ZP g and ZP, from g — r and r - i colors) by minimizing 
the weighted, perpendicular color-distance residual: 

\d a p\ 



dir. 



(4) 



where a a is the vector of measurement errors, d„p is the vector 
distance in color space between star a and locus point B, and d a p 
is a unit vector with the same orientation as d a p. Calculating dis- 
tances in color space, however, becomes increasingly cumbersome 
for growing numbers of filters such as we have. In particular, n 
magnitude measurements produce (n 2 - n)/2 colors (e.g., 10 colors 
for 5 bands, 15 colors for 6 bands) with correlated errors. 

To improve the accuracy of the calibration, we instead employ 
a simple^ 2 method that enables simultaneous fitting for large num- 
bers of consistent zeropoints, and avoids correlated input errors that 
arise when calculating distances in color space. We follow several 
steps to measure the^ 2 0t GOF for each set of ZPf fit parameters. 
For each star a and locus point B, a common, relative zeropoint 
O a p between m"^ obs) and + ZPf is shared across all filters 

/. This relative zeropoint O a p accounts for the difference between 
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the normalization of the star's instrumental magnitudes, which de- 
pends on the star's apparent magnitude, and the arbitrary normal- 
ization of the model SED. The x^p agreement between m"'"* 5 ' and 
^(models _ ZPf is a function of Qap . 

- / mf obs) - (m™ - ZP f + O ap ) \ 2 

X^O*) = £M . (5 ) 

f=i \ e f I 

We identify the relative zeropoint 0„p that minimizes xlp(O a p) (i.e., 
maximizes the likelihood). This is simply the weighted mean of the 
differences between m a f {obs) and n$ model> - ZPf. 

Z" H (m* ,) - (rrf} moder > - ZP f ))/(e" (obs) ) 2 

- E » =1 l /(e ^))2 • W 

For each star a, we loop over the locus points to find the locus 
point B with the smallest X 2 a p{0* a p) statistic, Xa™""- We then sum 
Xa™"" over all stars a to calculate the overall GOF statistic for the 
given set of input zeropoints ZPfx\ ol = I la X^ aa *- 

To find the best-fitting zeropoints, we use the amoeba downhill 
simplex method (Nelder & Mead 1965). We solve for zeropoints in 
all filters (except one) simultaneously. This yields more robust and 
accurate zeropoints than previous implementations and improves 
the accuracy of our photometric redshifts. 

The open-source Python code is available at http://big-macs- 
calibrate.googlecode.com. Photometric calibration requires only a 
catalog of measured stellar magnitudes and the total transmission 
function for each filter. 



4.6 Testing Stellar Locus Matching Against SDSS 
Calibration and SFD Dust Extinction 

Of the 51 galaxy-cluster fields in the sample, 40 have available 
SDSS photometry. The galaxy-cluster fields in our sample span a 
wide range in RA and DEC (see Paper I), as well as Galactic lat- 
itude and longitude, and therefore the ranges of stellar metallicity 
and dust extinction are wide. We can use the available SDSS pho- 
tometry to perform a consistency check on the spectroscopic lo- 
cus model. We first correct the available SDSS stellar g'r'i'z' pho- 
tometry in each galaxy-cluster field by the Schlegel, Finkbeiner & 
Davis 1998 (SFD) extinction, and then perform stellar locus match- 
ing (SLM) on the corrected SDSS photometry. The residual color 
fesLM _ '"sLM) _ tesDss _ ''sDss) nas mea n 0.006 mag and standard de- 
viation 0.014 mag, while the residual color (r' slM - z' SLM ) - (r' &DSS - 
4dss) nas mea n -0.005 mag and standard deviation 0.019 mag. 
This excellent agreement demonstrates the effectiveness of our stel- 
lar locus matching technique. 



5 PHOTOMETRIC REDSHIFT ALGORITHMS AND 
TEMPLATES 

We estimate photometric redshifts with the Bayesian BPZ code 
(Bemtez 2000; Coe et al. 2006), which uses spectral templates 
as a model for the rest-frame SEDs of galaxies. Constraining 
the possible redshifts of galaxies based on their broadband col- 
ors (Baum 1962) is feasible because of regular features (e.g., the 
4000 A break) in rest-frame galaxy spectra. Given galaxy photome- 
try in n broadband filters, the BPZ code defines n— 1 colors C = {cf} 
relative to a 'base' filter (e.g., the band in which the galaxies were 
selected). BPZ computes the likelihood p(C\z, T) of the observed 



galaxy colors C at a discrete series of possible redshifts z (e.g., 
0.01, 0.02, 0.03, ...) for each galaxy template T. 

Bemtez (2000) introduced the use of prior probability density 
functions p(z, m, T) into galaxy photometric redshift estimation, 
which depend on galaxy redshift z, apparent magnitude m, and tem- 
plate type T. He calibrated the parameters of these prior functions 
using 737 galaxies, including -130 with spectroscopic redshifts, 
with 20 < / < 27 from the HDFN (Williams et al. 1996), and 591 
additional galaxies with spectroscopic redshifts and 20 < / < 22.5 
from the Canada-France Redshift Survey catalog (Lilly et al. 1995) 
with spectral classification from V-l colors. When the colors C are 
consistent with the SED templates at multiple redshifts, the prior 
function can alleviate or break redshift degeneracies. Here we use 
the Bemtez (2000) prior functions with a modest adjustment that 
increases the probability at low redshift (z < 0.2; see Erben et al. 
2009). The posterior redshift probability distribution is 

p(z) = p(z\C, m) cc £ p(C\z, T)p(z, m, T). (7) 

T 

5.1 Most Probable Redshift z p 

The peak of the posterior redshift probability distribution p(z) is, 
by definition, the most probable redshift of the galaxy, z p (the BPZ 
parameter BPZ_Z_B). We use BPZ to compute, from the estimated 
p(z) distribution, the probability that the galaxy redshift is within 

±0.2(l+z„) of z p , 

XZp+0.2(l+z p ) 
p(z)dz. (8) 
.,,-0.2(l+ Z;) ) 

For example, the ODDS statistic for a Gaussian p(z) centered at z p 
with standard deviation cr = 0.067(1 + z p ) would be 0.95. 

5.2 Template Set 

The performance of the BPZ code depends on the template library. 
For this study, we use the template set assembled by Capak (2004), 
which comprises elliptical, Sbc, Scd, and Im spectra from Coleman 
et al. (1980) and two starburst templates from Kinney et al. (1996). 
Capak (2004) adjusted these templates both to increase agreement 
between spectroscopically determined redshifts and the BPZ most 
probable redshift z p and to match the colors of galaxies fainter than 
the limiting magnitudes of spectroscopic samples. 

The Capak (2004) templates are ordered by type (see list 
above), and, before fitting, we generate additional templates by in- 
terpolating eight times between adjacent templates in this ordered 
list (i.e., INTER=8). 

Figure 5 shows an example photometric redshift fit to a galaxy 
with B jVjRc!cZ + magnitudes. We show the best-fit template and 
measured galaxy magnitudes, as well as the p(z) distribution and 
spectroscopic redshift value. 



6 GALAXY SED EXTRAPOLATION TO FIND u' AND S, 
ZEROPOINTS 

The photometric calibration of very blue filters, particularly u*, 
from stellar locus matching is more challenging than for redder 
bands. Absorption by metals at near-UV and blue-optical wave- 
lengths causes the u* and S ; magnitudes of main sequence stars 
to depend more strongly on the chemical abundances in the stellar 
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Figure 5. Best-fit SED model for a sample galaxy's broadband magnitudes from the BPZ photometric redshift code (left panel) and posterior redshift proba- 
bility distribution (right panel). In this example, the template with greatest posterior probability is the elliptical galaxy spectrum from Coleman et al. (1980), 
modified by Capak (2004). The red points in the left panel show, in order of increasing central wavelength, the calibrated galaxy magnitude measured for the 
SuprimeCam Bj,Vj,Rc, lc, and z + bands. The blue rectangles show the expected flux for the elliptical SED in these photometric bands. In the right panel, the 
horizontal coordinate of the vertical red line is the spectroscopic redshift of this elliptical galaxy. 



atmospheres than Vj,i? c ,/ C , and z + magnitudes (see Section 4.2). 
We have developed a technique to improve the calibration of 
Megaprime w*-band and SuprimeCam By-band observations by ex- 
trapolating, using BPZ, from the photometry at wavelengths where 
the stellar spectra are less affected by metallicity variation (e.g., 
Vj,Rc,Ic, an d z + ). We apply this technique to fields for which we 
have u* oi Bj observations in addition to observations in at least 
four other bandpasses. Our method employs a training set of 10,000 
galaxies, drawn randomly from the galaxies whose it and Bj mag- 
nitudes have uncertainties less than 0.05 mag. We modify the pho- 
tometry catalog so that Bj magnitudes are assigned an uncertainty 
of 0.2 and every u* magnitude is assigned an uncertainty of 90 mag 
(effectively removing any constraint from u + magnitudes). 

We use the BPZ photometric redshift code to calculate for 
each galaxy the expected u* and B j magnitudes using only the pho- 
tometry at redder observer-frame wavelengths, based on the best- 
fit SED template model (e.g., elliptical, Sbc) and redshift. The ze- 
ropoint corrections for the u* and Bj bands are estimated as the 
median difference pi/2 between the predicted and measured magni- 
tudes for galaxies, 

ZP„ CW - ZP M = p m (m^ - mJJJ), (9) 

although the peak as opposed to median difference could possibly 
yield improved accuracy. After calculating photometric redshifts 
for the training set of galaxies, we then reject any galaxy with BPZ 
ODDS parameter less than 0.95 and most probable photometric 
redshift smaller than z p = 0.4 so that the 4000 A break does not 
fall near the u* or Bj bandpasses. These criteria yield a sample of 
~1000 training galaxies. 

The zeropoints we estimate with this technique will of course 
depend on the accuracy of the BPZ template set used to compute 
the expected if and Bj magnitudes. This procedure generally yields 
small ~0.01 mag adjustments to the Bj-band zeropoint calibrated 



through stellar locus matching. We do not use available w*-band 
photometry for measuring cluster masses (Paper III). 



7 PHOTOMETRIC CALIBRATION TESTS 

We demonstrate the reliability of the stellar-locus-matching zero- 
point calibration, as well as the performance of the redshift estima- 
tion, by comparing our estimates for the most probable photometric 
redshift z p and the posterior probability distribution p(z) to redshifts 
derived from spectra in both our galaxy cluster fields and the Cos- 
mic Evolution Survey (COSMOS; Scoville et al. 2007) field, as 
well as accurate photometric redshifts from thirty-band photome- 
try of COSMOS galaxies (COSMOS-30; Capak et al. 2007; Ilbert 
et al. 2009). 

7.1 Spectroscopic Redshift Comparisons in the Cluster Fields 

7.1.1 Galaxy Samples in Cluster Fields 

We have measured, or compiled from the NASA Extragalactic 
Database 3 , spectroscopic redshifts for 5007 galaxies in 27 cluster 
fields with five bands of photometry. We acquired spectra for galax- 
ies in our cluster fields (e.g., Barrett 2006; Ma et al. 2008) with 
the Deep Imaging Multi-Object Spectrograph (DEIMOS) on the 
Keck-II telescope, the Low Resolution Image Spectrometer (LRIS) 
on Keck-I, the Gemini Multi-Object Spectrographs (GMOS) on 
the Gemini telescope, and the Visible Multi-Object Spectrograph 
(VIMOS) on the VLT. Of the 5007 galaxies, 4735 (95%) have 
BPZ ODDS > 0.9. In Figure 6, we plot the most probable pho- 
tometric redshift z,, versus the spectroscopic redshift z s for these 

3 http://ned.ipac.caltech.edu/ 
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Figure 6. (Plots for seven clusters are shown on next page.) Left panels show the most probable photometric redshift z p plotted against spectroscopic redshift 
z s for galaxies in fourteen cluster fields with at least five bands of photometry. Spectroscopic redshifts are principally from Keck multi-fiber spectroscopy 
of cluster fields, with additional measurements collected from the NASA Extragalactic Database (NED). We show only photometric redshift estimates for 
galaxies for which the p(z) probability distribution shows substantial weight close to the most probable redshift z, p (BPZ ODDS > 0.9). Right panels show 
histograms of Az = (z. p - Zj)/(1 + Zs)', the overlaid curves (in red) and the statistics for the mean /ja- and standard deviation o"A; correspond to Gaussian fits that 
exclude outliers with |Az| > 0. 1. 
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Figure 6 - continued 
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Figure 7. Most probable photometric redshift zp plotted against spectro- 
scopic redshift z s for 4735 galaxies with BPZ ODDS > 0.9, in cluster fields 
with at least five bands of photometry. Spectroscopic redshifts are from 
Keck, Gemini, and VLT spectroscopy of cluster fields, with additional mea- 
surements collected from the NASA Extragalactic Database. The histogram 
in the top left corner shows the distribution of Az = (z p - Z s )/(1 + Zs), the 
overlaid curve (in blue) and the statistics for the mean /i and standard devi- 
ation cr correspond to Gaussian fits that exclude outliers with |Az| > 0.1. 



Figure 8. Histogram of the most probable photometric redshift z„ for mem- 
bers of the red sequence in MACS091 1+17. The spectroscopic redshift of 
the cluster (z, = 0.51) is shown as a blue vertical line. See Sec. 7.2 for 
details on the selection of red-sequence galaxies. These plots provide a di- 
agnostic check of the photometric redshift quality available even when few 
spectroscopic redshifts are available. 



galaxies with BPZ ODDS > 0.9 in the 14 cluster fields with the 
largest numbers of available spectroscopic redshifts. The photomet- 
ric redshifts z p for each of these 14 fields have typical accuracy of 
o-((z p - z s )/(l + z,)) < 0.03 and bias (z p - z,)/(l + z s ) < 0.01, with 
\(z p — Zj)/(1 +Z S )\ > 0.1 outliers excluded. 

7.1.2 Combined Sample of Galaxies in Cluster Fields 

Figure 7 compares the most probable photometric red- 
shift z p to the spectroscopic redshift z s for the combined 
sample of galaxies with spectroscopic redshifts and BPZ 
ODDS > 0.9, across all cluster fields. The photometric red- 
shifts have accuracy of cr((z p - z,,)/(l + z,,)) = 0.036 and bias of 
<(z P - z s )/(l + z s )> = -0.001, with \z„ - z s |/(l + z s ) > 0.1 outliers 
(8.2% of galaxies) excluded. 

7.2 Photometric Redshifts of Galaxies on the Red Sequence 

Using plots of galaxy color versus magnitude for each cluster field 
(e.g., Vj - Rc versus Vj), we find the best-fit slope, intercept, and 
width of the distributions of red sequence galaxies for each cluster. 
We use these parameters to identify the galaxies along the red se- 
quence (see Paper III for details). These samples may include fore- 
ground and background galaxy populations that happen to have the 
same colors and magnitudes as the red sequence - a contamination 
that becomes increasingly significant at fainter magnitudes. Fig- 
ure 8 shows the histogram of most probable photometric redshifts 
z p with BPZ ODDS > 0.90, for galaxies along the MACS091 1 + 17 
red sequence. 

For cluster fields imaged through five or more filters (e.g., 




Spectroscopic z 



Figure 9. Peak photometric redshift z^eak, for galaxies on the red sequence 
in each cluster field, plotted against the spectroscopic cluster redshift z c i us ter- 
The redshift estimate corresponds to the center of the z p redshift bin 
(with bin width Az = 0.01) with the greatest number of red sequence mem- 
bers. Figure 8 shows an example z p histogram, for the MACS 091 1 + 17 red 
sequence. 
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BjVjR c IcZ + ), we plot in Figure 9 the peak redshift in the his- 
togram of most probable redshifts, z pi;ak , against the galaxy cluster 
spectroscopic redshift z c i uster - While we have not otherwise made 
use of available near-UV photometry when computing photomet- 
ric redshifts, we include CFHT MegaPrime u* photometry to es- 
timate Zpenk for the lowest redshift cluster in the sample, Abell 
383, since near-UV photometry, combined with blue optical pho- 
tometry, bracket the 4000 A break of z < 0.2 galaxies. The dis- 
persion of Zp Cllk is cr((z pcak - z c i uster )/(l + Zciuster)) = 0.011 and the 
bias is ((z pell k - z c iuster)/(l + Zciuster)) = -0.005. The histogram bins 
have Az = 0.01 width, which places a lower limit on the accuracy 
of the cluster redshift estimates. A more accurate estimate of the 
peak cluster redshift could be obtained by applying, for example, a 
weighted Gaussian-kernel density estimator to interpolate between 
adjacent bins and by estimating photometric redshifts with greater 
precision than Az = 0.01. 



7.3 Galaxies in the COSMOS Survey 

Accurate redshifts based on thirty-band photometry (Capak et al. 
2007; Ilbert et al. 2009) are available for galaxies to i + ss 25 mag in 
a 2 deg 2 field imaged for the Cosmic Evolution Survey (COSMOS; 
Scoville et al. 2007). Spectroscopic redshifts are also available for 
a sample of COSMOS galaxies with / < 22.5 mag (zCOSMOS; 
Lilly et al. 2009). The photometry of the COSMOS field includes 
SuprimeCam BjVjr + i + z + magnitudes, which we use to estimate 
photometric redshifts in an identical manner as for the photome- 
try of the galaxy cluster fields, after recalibrating their zeropoints. 

The magnitude limit of i + a 25 for the COSMOS catalog is 
comparable to that of our Subaru and CFHT galaxy cluster imag- 
ing, and beyond the completeness limit of today's spectroscopic 
samples. To test the five-filter photometric redshift estimates, we 
apply to the COSMOS-30 photometry catalog the same stellar lo- 
cus matching and photometric redshift estimation procedure as we 
do for the cluster fields. We compare our most probable BjVjr + i + z + 
photometric redshifts z p to zCOSMOS spectroscopic redshifts and 
COSMOS-30 photometric redshifts estimated from thirty broad, in- 
termediate, and narrow bands spanning the UV to the mid-IR (Ilbert 
et al. 2009). The accurate COSMOS-30 photometric redshifts en- 
able us to also test the posterior redshift probability distributions 
p(z) based on five bands of photometry. 



7. 3. 2 COSMOS-30 Photometric Redshifts of Galaxies with 
i + < 25 

Ilbert et al. (2009) estimated redshifts using the thirty-band pho- 
tometry available in the COSMOS survey for several hundred 
thousand galaxies to i + < 25 mag (the COSMOS-30 red- 
shifts). These authors used the LePhare code (Arnouts et al. 
1999; Ilbert et al. 2006) and rest-frame galaxy templates that in- 
clude narrow line features. The accuracy of COSMOS-30 pho- 
tometric redshifts, estimated by comparing them to the zCOS- 
MOS spectroscopic redshifts z s for the faint sample, ranges 
from cr((z COSMOS _ 30 -z s )/(l + z s ))=0.007 at i + < 22.5 mag to 
o-((zcosmos-30 - z s )/(l + z,,))=0.06 at i + « 24 mag. 

The right panel of Figure 10 compares the most prob- 
able photometric redshift z p we estimate from recalibrated 
COSMOS BjVjr + i + z + magnitudes to the COSMOS-30 pho- 
tometric redshift. The photometric redshifts have a stan- 
dard deviation of cr((z p - z C osmos-3o)/(1 + z C osmos-3o)) = 0.042 
and bias of ((z p - Zcosmos-3o)/(1 + Zcosmos-3o)> = 0.002, with 
\(z p -Zcosmos-3o)/(1 + Zcosmos-3o)I > 0.1 outliers (14.2%) ex- 
cluded. Only galaxies with BPZ ODDS > 0.9 are plotted, which 
includes 81,024 of 113,316 galaxies. 

Representing the redshift probability distribution p(z) for each 
galaxy as a single redshift (e.g., the most probable redshift z p ) ne- 
glects the sometimes substantial probability that the true galaxy 
redshift may have a very different value (e.g., Cunha et al. 2009; 
Abrahamse et al. 2011; Sheldon et al. 2011). Single-point estimates 
can lead to inaccurate conclusions, although the degree of inaccu- 
racy will vary with the question being addressed and the precise 
p(z) distribution. The p(z) distributions for the majority of COS- 
MOS galaxies with B,Vjr + i + z + photometry to i* < 25 cannot be 
parameterized with a simple function (e.g., the sum of two Gaus- 
sian distributions). 

The COSMOS data allow us to assess the accuracy of the 
BjVjr + i + z + posterior redshift probability distributions p(z). We 
partition galaxies from the COSMOS-30 catalog into bins accord- 
ing to most probable redshift z p , with bin widths of 0.2. For the 
galaxies in each redshift bin, we calculate the stacked p(z) distribu- 
tion £ ; p(z) from our BjVjr + i + z + data. In Figure 11, we compare 
the stacked distribution from BjVjr + i + z + photometry to the redshift 
distribution for thirty-band Ilbert et al. (2009) COSMOS-30 photo- 
metric redshifts for galaxies with i* < 25 mag and with z p in four 
redshift ranges. We include all galaxies with BPZ ODDS > 0.5. 
The comparisons show agreement between the p(z) distribution we 
estimate and the Ilbert et al. (2009) COSMOS-30 estimate, which 
uses substantially more complete photometry. 



7.3. 1 zCOSMOS Spectrosopic Redshifts with I < 22.5 

Lilly et al. (2009) measured spectroscopic redshifts for galaxies 
in the COSMOS field and made publicly available the spectro- 
scopic redshifts for targets with / < 22.5 (the zCOSMOS bright 
sample). The left panel of Figure 10 shows the most probable 
photometric redshift z p based on BjVjr + i + z + photometry recali- 
brated with stellar locus matching for the COSMOS field versus 
the zCOSMOS spectroscopic redshifts z„ for galaxies with BPZ 
ODDS > 0.9. The histogram in the upper left corner of the panel 
shows the level of agreement between z p and z s . The photometric 
redshifts have accuracy of cr((z p - z,,)/(l + z.,)) = 0.040 and bias 
of ((z„ - z s )/(l + z s )> = -0.005, with \ Zp - z s |/(l + z s ) > 0.1 out- 
liers (10.5% of galaxies) excluded. Only the galaxies with BPZ 
ODDS > 0.9 are plotted, which includes 6842 of 7959 galaxies. 



8 TOMOGRAPHIC WEAK LENSING 

Correlations in the cosmic shear field hold significant promise as 
a tool for investigating dark energy (e.g., Hoekstra et al. 2006; 
Massey et al. 2007b; Fu et al. 2008; Schrabback et al. 2010). How- 
ever, accessing the statistical power available from upcoming sur- 
veys (e.g., PanSTARRS, Dark Energy Survey, SkyMapper, Large 
Synoptic Sky Survey) to constrain cosmological parameters will 
require a complete characterization of the biases of both shear mea- 
surements and photometric redshift algorithms. The massive galaxy 
clusters in our sample provide ideal proving grounds for these tech- 
niques because the deep gravitational potentials of the galaxy clus- 
ters should yield strong tomographic shear signals. 

The growth in shear with increasing redshift of a galaxy be- 
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ZCOSMOS Spectroscopic z COSMOS-30 Photometric z 



Figure 10. The left panel shows the most probable redshift z p calculated from five Subaru bands (BjVjr + i + z + ) in the COSMOS catalog plotted against the 
zCOSMOS spectroscopic redshifts for 6842 galaxies. In the right panel, z„ is plotted against the COSMOS-30 photometric redshifts for 81,024 galaxies. The 
five-band z p values are computed from the COSMOS photometry recalibrated using stellar locus matching. We include a galaxy in the plot only if BPZ ODDS 
> 0.9. The histogram in the top left corner of each plot shows the distribution of Az = (z p - zcosmos)/(1 + zcosmos); the overlaid curve (in blue) and the 
statistics for the mean /j and standard deviation cr correspond to Gaussian fits that exclude outliers with |Az| > 0.1. 



hind a gravitational lens (z s ) is sensitive to cosmological parame- 
ters through its dependence on the ratios of angular diameter dis- 
tances (Jain & Taylor 2003; Taylor et al. 2007). The tangential shear 
at a radius r from the cluster center is given by 



7t(r,zs) 



S« r) - Kr) 



(10) 



where E(< r) is the mean surface mass density inside radius r and 
£(r) is the azimuthally averaged surface mass density at radius r. 
S c is the critical surface mass density, 



D s 



(11) 



AnG D l D ls ' 

where Ds, D L , and D L g are the angular diameter distances from 
the observer to the lensed source galaxy, from the observer to the 
lens (the cluster), and from the lens to the source galaxy, respec- 
tively. The dependence of the weak gravitational shear on angular 
diameter distances is therefore 

DlsDi 



7t • 



Ds 



(12) 



Since D L is the same for all sources lensed by a given galaxy clus- 
ter, the dependence of shear on the source redshift is simply 

D LS 

7t -p— = P(.z). 

Shape measurements yield an estimate for the reduced shear g 
rather than the shear y; the two are related by 

. ft 7- 



where 



1 - ft*. 



(13) 



(14) 



Ka, and y x are the convergence and shear, respectively, of a lensed 
source at z = °°. 

To estimate the actual reduced shear g from the measured 
shear g, we use a linear shear correction with slope m(r h ) and inter- 
cept c measured from STEP2 simulations (Massey et al. 2007a): 



(15) 



1 + m(r h ) ' 

where m(r h ) is a function of the object size r h relative to the PSF 
size of the observation (see Papers I and III). 

Taylor et al. (2007) projected the constraints on cosmologi- 
cal parameters that would be possible from a five-band optical sur- 
vey of galaxy clusters with masses M ss 10 I5 M G . For a survey of 
fifty of the most massive low- to moderate-redshift galaxy clusters 
with five-band photometry and a median source galaxy redshift of 
z = 0.7, similar to this paper's sample, Taylor et al. (2007) forecast 
that the dark-energy equation-of-state parameter w may be con- 
strained to within ±0.6, after combination with a WMAP prior. This 
ancipated precision is for models that allow curvature and evolving 
w, and improved constraints will therefore be possible for a flat cos- 
mology with a static value for w. For a five-band survey covering 
10000 deg 2 with median source galaxy redshift of z = 0.7, Taylor 
et al. (2007) projected a constraint of Aw = 0.0075. Taylor et al. 
(2007) made the assumption that the errors of photometric red- 
shifts are Gaussian and that any magnitude- or size-dependent bias 
in ground-based shear measurements can be controlled to z = 1.5. 

In a number of pioneering efforts, an increase in shear signal 
has been been observed with distance of the source galaxies be- 
hind groups or clusters. Wittman et al. (2001) plotted the measured 
shear against photometric redshift for the galaxies lensed by a mas- 
sive cluster at z = 0.276 (see also Wittman et al. 2003). Kitching 
et al. (2007) combined a tomographic analysis of the A901/2 su- 
percluster, with a cosmic-shear analysis of two randomly selected 
fields using COMBO-17 photometric redshift estimates. Medezin- 
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Figure 11. Stacked p(z) distribution, 2 go / P(z), from BjVjr + i + z + photometry (open histogram) compared with the redshift distribution for thirty-band Ilbert 
et al. (2009) COSMOS-30 photometric redshifts (red histogram) for galaxies with ;' + < 24.5 mag and with z p in four redshift ranges: 0.4 < Zp < 0.6, 
0.6 < z p < 0.8, 0.8 < z. p < 1, and 1 < z p < 1.2. We include all galaxies with BPZ ODDS > 0.5. The comparisons show good agreement between the 
p(z) distribution from our analysis and the Ilbert et al. (2009) estimates that use substantially more complete photometry. We use the more informative p{z) 
distribution, rather than the most probable redshift z p to measure galaxy-cluster masses. As discussed in Paper III, this leads to significantly improved accuracy 
in our weak-lensing mass results. 



ski et al. (2011) showed that the shear signal increases with dis- 
tance of sources behind the A370, ZwC10024+17, and RXJ1347- 
1 1 galaxy clusters, based on color cuts applied to three-filter Subaru 
imaging. Taylor et al. (2011) measured the redshift dependence of 
the weak lensing shear behind groups in the COSMOS field using 
HST images for shape measurements and the Ilbert et al. (2009) 
thirty-band photometric redshifts. 



8.1 Tomographic Signal Behind Individual Massive Clusters 

We first examine how, for individual clusters, the variation of mea- 
sured shear with source redshift compares with the prediction for a 
fiducial A-CDM cosmology (n M =0.3, £2 A =0.7, w=-l, h=0J). 

To determine the shear signal as a function of redshift in each 
field, we bin galaxies with BPZ ODDS > 0.5 according to their 
most probable redshift z p . We fit a singular isothermal shear (SIS) 
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Figure 12. The measured weak-lensing shear as a function of most probable redshift z„ for source galaxies in eight different massive cluster fields, with 
photometric redshifts measured with at least five filters. The shear y t (z) is normalized to the expected asymptotic value y t (z = oo). The points show the measured 
amplitude of the tangential shear profile, assuming a singular isothermal model for the cluster mass-density distributions. The solid curve in each plot shows 
the cosmological-distance-redshift relation for sources behind the cluster for our reference A-CDM cosmology (£2 M =0.3, £i\=0.1, w=-l, h=0J) normalized 
to fit the shear amplitude. The error bars on the measured values are the 68% uncertainties estimated from bootstrapped galaxy samples. The measured shear 
is based on galaxies for which BPZ ODDS > 0.5. 
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profile 



y,(r, z e zd ■ 



A, 



(16) 



to the tangential shear y t of the galaxies in the ith redshift bin z,, 
where r is the distance to the center of the galaxy cluster. We divide 
each amplitude A, by the asymptotic value A x at z = oo expected 
for the fiducial A-CDM cosmology: 

y,(z e zd _ Aj_ 

A ' 



(17) 



y,(oo) 

Figure 12 shows the variation of y,(z e Zi)ly{°°) versus mean red- 
shift for eight cluster fields with excellent filter coverage (Suprime- 
Cam BjVjRcIcZ + and, in some cases, also MegaPrime u'g'r'i'z*)- 
The error bars are uncertainties we estimate from bootstrapped 
samples of galaxies. These comparisons show good agreement be- 
tween the observed variation in y,(z)/y(°o) with redshift and the 
expectation for the fiducial cosmology. 

8.2 Tomographic Signal From a Stack of 27 Massive Clusters 

For a cosmology with zero spatial curvature (i.e., a 'flat' universe), 
the angular diameter distance D(z) is equal to the product of the 
comoving distance tj z and the scale factor a z , 



D(z) = cj z x a z 



(18) 



Taylor et al. (2011) used this relation to stack the measured shear 
as a function of distance for multiple lenses. 

For a flat geometry, Equation 12 simplifies to 

<j) L (u) S - col) 



y,(z) oc 



(19) 



cos (I +Zl) 

where tog and cj l are comoving distances from the observer to the 
source and lens, respectively, and zl is the redshift of the lens. 
Defining the ratio x of the two comoving distances ll>s and a> L as 

x = U) s /to L , (20) 

we can express Equation 19 in terms of x: 

y,(x) oc -^(1 - -). (21) 
1 + Zl x 

Therefore, in a flat universe, we can express the tangential shear 
ratio y,{x)ly,{x = oo) simply in terms of x: 



yM =1 _ l 

y t (x = oo) x 



(22) 



We note that y,(x = oo) is not physical for the reference cosmol- 
ogy, because the ratio of comoving distances x remains finite as the 
source redshift z approaches infinity. 

When estimating y,(x)ly,(x = oo) for each galaxy, we divide 
the measured shear by the shear expected for a source galaxy at 
x = oo given the best-fit mass profile for the cluster (Paper III). We 
place the galaxies in the 27 cluster fields with photometry through 
at least the BjVjR c IcZ + bands (see Table 1 of Paper I) into bins ac- 
cording to x. The value of y t (x)ly,(x = oo) for the sample of galax- 
ies in each redshift bin is then estimated with a weighted Gaussian- 
kernel density estimator. Only the galaxies that are used to estimate 
cluster masses in Paper III are included; see that paper for a detailed 
description of the galaxy selection criteria. Galaxies are included 
only if the most probable photometric redshift is z p < 1.25. 

In Figure 13, the points correspond to the stacked shear signal 
calculated in the reference cosmology, and the error bars are esti- 
mated from bootstrapped galaxy samples at each redshift. The data 




Figure 13. The stacked shear ratio for 27 clusters as a function of scaled 
comoving distance. The horizontal axis is the ratio of comoving distances 
from the observer to the source galaxy and to the lens. The vertical axis 
is the shear signal divided by that for a hypothetical (although physically 
disallowed) source at .v = oo calculated for the fiducial cosmology and the 
best-fit cluster mass (from Paper III). The black curve is the predicted shear 
scaling 1 - \ jx for the reference cosmology. Points are the modes (maxi- 
mum likelihood) measured from a Gaussian-kernel density estimator, and 
the error bars represent the 68% confidence interval as estimated from boot- 
strapped samples of galaxies in each bin. 



show good qualitative agreement with the variation expected for 
the reference cosmology. We defer to a follow-up paper the deter- 
mination of cosmological constraints from the tomographic lens- 
ing signal, using the entire redshift probability distribution p(z) for 
each galaxy. 



9 CONCLUSIONS 

Converting the signal measured by CCD sensors mounted on a 
wide-field camera to calibrated AB magnitudes with better than 
several percent accuracy has typically required time-consuming 
calibration exposures as well as optimal observing conditions. 
When useful overlap with survey photometry is not available (e.g., 
limited dynamic range, color transformations from survey magni- 
tudes), repeated standard-star observations in photometric condi- 
tions are necessary for accurate zeropoint calibration. Dedicated 
exposures sequences of dense stellar fields have been consid- 
ered necessary to measure the position-dependent zeropoint er- 
ror (~0.04-0.06 mag for SuprimeCam) present after correcting by 
dome or sky-flat calibration exposures. 

For the Subaru and CFHT imaging that we analyzed, ded- 
icated calibration data were not always available and conditions 
were not always photometric. We have developed improved tools 
that use only the instrumental stellar magnitudes in exposures of 
science targets to produce accurate star-flat models and color cal- 
ibrations. These have sufficient power to enable high-quality pho- 
tometric redshift estimates without spectroscopic training sets. We 
use 4735 galaxy spectra in 27 cluster fields across the sky to mea- 
sure an accuracy of cr((z p - z s )/(l + z s )) « 0.03. 

We achieve < 0.01 - 0.02 mag color calibration by match- 
ing the observed stellar locus in color-color space to a model stel- 
lar locus. We developed a spectroscopic model for the dereddened 
SDSS stellar locus to estimate more accurately the locus expected 
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for SuprimeCam and MegaPrime filters. This step is important be- 
cause nonlinear color transformations exist between our Suprime- 
Cam BjVjRcIcZ + magnitudes and the SDSS u'g'r'i'z' filters used 
to measure the stellar locus. We created a simple x 1 GOF statistic 
that makes possible a solution for all unknown zeropoints simul- 
taneously, without correlated errors. These improvements, includ- 
ing the correction of the SDSS stellar locus for Galactic extinc- 
tion, were necessary to estimate robust and reliable photometric 
redshifts from our Subaru and CFHT imaging. Interested readers 
may find the Python code as well as a spectroscopic model for the 
locus at http://big-macs-calibrate.googlecode.com. Calibration re- 
quires only a catalog of stellar magnitudes and the total transmis- 
sion function for each bandpass filter. 

Summarizing the redshift posterior probability distribution 
p(z) for each galaxy as a single redshift (e.g., the most probable 
redshift z p ) disregards any probability that the true galaxy redshift 
may have a very different value (e.g., catastrophic outliers). The 
p(z) distribution contains more information than the most proba- 
ble redshift, or other single-point estimates. We find good agree- 
ment between the stacked Y* ga i P(z) and the redshift distribution es- 
timated from thirty COSMOS photometric bands by Ilbert et al. 
(2009). The p(z) distributions will provide a more powerful, less 
biased tool for science analysis than single-point estimates for on- 
going and future surveys (e.g., PanSTARRS, Dark Energy Survey, 
SkyMapper, Large Synoptic Sky Survey). 

The extreme gravitational potentials of massive galaxy clus- 
ters are expected to yield a simple, strong growth in shear signal 
with increasing distance of the source galaxy behind the cluster 
lens. The shape of the growth of shear behind a stack of 27 clusters 
with increasing redshift shows agreement with a fiducial A-CDM 
cosmology. The average shear we measure for foreground galaxies 
is consistent with zero. 

The software we make available can produce state-of-the-art 
photometric accuracy without dedicated observations or the need to 
estimate nonlinear color terms. Stellar colors will be a useful tool 
for checking the accuracy of zeropoints obtained through ubercal- 
ibration of surveys (e.g., Ivezic et al. 2007; Padmanabhan et al. 
2008; Schlafly et al. 2012) or close monitoring of atmospheric 
transparency (e.g., Burke et al. 2010; Blake & Shaw 201 1). 
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